Stability Selection for Structured Variable Selection
نویسندگان
چکیده
In variable or graph selection problems, finding a right-sized model or controlling the number of false positives is notoriously difficult. Recently, a meta-algorithm called Stability Selection was proposed that can provide reliable finite-sample control of the number of false positives. Its benefits were demonstrated when used in conjunction with the lasso and orthogonal matching pursuit algorithms. In this paper, we investigate the applicability of stability selection to structured selection algorithms: the group lasso and the structured input-output lasso. We find that using stability selection often increases the power of both algorithms, but that the presence of complex structure reduces the reliability of error control under stability selection. We give strategies for setting tuning parameters to obtain a good model size under stability selection, and highlight its strengths and weaknesses compared to competing methods screen and clean and cross-validation. We give guidelines about when to use which error control method.
منابع مشابه
Finding stability regions for preserving efficiency classification of variable returns to scale technology in data envelopment analysis
This paper addresses issue of sensitivity of efficiency classification of variable returns to scale (VRS) technology for enhancing the credibility of data envelopment analysis (DEA) results in practical applications when an additional decision making unit (DMU) needs to be added to the set being considered. It also develops a structured approach to assisting practitioners in making an appropria...
متن کاملEvolutionary Stability in One-Parameter Models under Weak Selection
A general notion of evolutionary stability is formulated in models in which the possible behaviours are parameterized by a continuous variable, and selection is assumed to be weak. Two local stability conditions are formulated, m-stability and &stability, the former being first-order and the latter second-order in the mutant behavioural deviation. The conditions are interpreted in two standard ...
متن کاملStability selection
Estimation of structure, such as in variable selection, graphical modelling or cluster analysis is notoriously difficult, especially for high-dimensional data. We introduce stability selection. It is based on subsampling in combination with (high-dimensional) selection algorithms. As such, the method is extremely general and has a very wide range of applicability. Stability selection provides f...
متن کاملGenetic worth and stability of selection indices in rice (Oryza sativa L.)
Improvement of one trait on its own will affect the performance of other traits because ofgenotypic correlations between traits. Index selection is one of the tools used by plant breedersto overcome this problem. The purpose of this paper is to evaluate selection indices developedfor improving grain yield in rice (Oryza sativa L.). Forty-nine rice genotypes were cultivated atTonekabon Rice Rese...
متن کاملSelection of suitable reference genes for real-time PCR studies of early developmental stages of sturgeons
In quantitative real-time PCR, the mRNA level can be quantified in relative terms based on the expression ratio of mRNAs of the target gene and an internal reference gene. Since, an internal standard should be expressed at a constant level among different tissues of an organism at all stages of development, and should be unaffected by the experimental treatment, the stability of different refer...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1712.04688 شماره
صفحات -
تاریخ انتشار 2017